On the Integrality Gap of the Subtour LP for 

the 1,2-TSP 



Jiawei Qian^"^, Frans Schalekamp, David P. Williamson^**, and 
Anke van Zuylen^ 

^ School of Operations Research and Information Engineering, 
pq Corneh University, Ithaca, NY 14853, USA. 

7— H jq35@cornell.edu, dpwOcs . Cornell .edu 

f^ ^ Max-Planck-Institut fiir Informatik, Department 1: Algorithms and Complexity, 

04 Campus El 4, Room 311c, 66123 Saarbriicken, Germany. 

5^ anke@mpi-inf.mpg.de 

, Abstract. In this paper, we study the integrality gap of the subtour LP 

relaxation for the traveling salesman problem in the special case when all 
I I edge costs are either 1 or 2. For the general case of symmetric costs that 

^0 obey triangle inequality, a famous conjecture is that the integrality gap 

^*^ is 4/3. Little progress towards resolving this conjecture has been made 

in thirty years. We conjecture that when all edge costs dj G {1,2}, the 
integrality gap is 10/9. We show that this conjecture is true when the 
optimal subtour LP solution has a certain structure. Under a weaker 
assumption, which is an analog of a recent conjecture by Schalekamp, 
^^ Williamson and van Zuylen, we show that the integrality gap is at most 

7/6. When we do not make any assumptions on the structure of the 
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S^ optimal subtour LP solution, we can show that inegrahty gap is at most 

19/15 ~ 1.267 < 4/3; this is the first bound on the integrality gap of the 
subtour LP strictly less than 4/3 known for an interesting special case 
of the TSP. 
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1 Introduction 

The Traveling Salesman Problem (TSP) is one of the most well studied problems 
in combinatorial optimization. Given a set of cities {1, 2, . . . , n}, and distances 
c{i,j) for traveling from city i to j, the goal is to find a tour of minimum length 
that visits each city exactly once. An important special case of the TSP is the 
case when the distance forms a metric, i.e., c(z, j) < c{i, k) + c{k, j) for all z, j, k, 
and all distances are symmetric, i.e., c{i,j) = c{j,i) for all i,j. The symmetric 
TSP is known to be APX-hard, even if c{i,j) G {1, 2} for all i,j [TS]; note that 
such instances trivially obey the triangle inequality. 
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The symmetric TSP can be approximated to within a factor of | using an 
algorithm by Christofides [7 from 1976. The algorithm combines a minimum 
spanning tree with a matching on the odd-degree nodes to get an Eulerian graph 
that can be shortcut to a tour; the analysis shows that the minimum spanning 
tree and the matching cost no more than the optimal tour and half the optimal 
tour respectively. Better results are known for several special cases, but, sur- 
prisingly, no progress has been made on approximating the general symmetric 
TSP in more than thirty years. A natural direction for trying to obtain bet- 
ter approximation algorithms is to use linear programming. The following linear 
programming relaxation of the traveling salesman problem was used by Dantzig, 
Fulkerson, and Johnson [9] in 1954. For simplicity of notation, we let G = (V, E) 
be a complete undirected graph on n nodes. In the LP relaxation, we have a 
variable x{e) for all e = (*, j) that denotes whether we travel directly between 
cities i and j on our tour. Let c(e) — c{i,j), and let S{S) denote the set of all 
edges with exactly one endpoint in S C V. Then the relaxation is 

Min Y^ c{e)x{e) 

eeE 

(SUBT) subject to: ^ x{e) = 2, Vi G V, (1) 

eeS{i) 

J2 2;(e) > 2, VS" c V^, 3 < |5| < \V\ - 3, (2) 

e<£S{S) 

< x{e) < 1, Ve e E. (3) 

The first set of constraints (fTl) are called the degree constraints. The second 
set of constraints (pi) are sometimes called subtour elimination constraints or 
sometimes just subtour constraints, since they prevent solutions in which there 
is a subtour of just the nodes in S. As a result, the linear program is sometimes 
called the subtour LP. It has been shown by Wolsey [19] (and later Shmoys and 
Williamson [T7]) that Christofides' algorithm finds a tour of length at most | 
times the optimal value of the subtour LP; these proofs show that the minimum 
spanning tree and the matching on odd-degree nodes can be bounded above by 
the value of the subtour LP, and half the value of the subtour LP, respectively. 
This implies that the integrality gap, the worst case ratio of the length of an 
optimal tour divided by the optimal value of the LP, is at most |. However, no 
examples are known that show that the integrality gap can be as large as |; 
in fact, no examples are known for which the integrality gap is greater than 
|. A well known conjecture states that the integrality gap is indeed |; see (for 
example) Goemans [lO] . 

Recently, progress has been made in several directions, both in improving 
the best approximation guarantee and in determining the exact integrality gap 
of the subtour LP for certain special cases of the symmetric TSP. In the graph- 
TSP, the costs c{i,j) are equal to the shortest path distance in an underlying 
unweighted graph. Oveis Gharan, Saberi, and Singh TH show that the graph- 
TSP can be approximated to within | — e for a small constant e > 0. Boyd, 



Sitters, van der Ster and Stougie [6], and Aggarwal, Garg and Gupta Ij inde- 
pendently, give a I -approximation algorithm if the underlying graph is cubic. 
Momke and Svensson il2j improve these results by giving a 1.461-approximation 
for the graph-TSP and an |-approximation algorithm if the underlying graph is 
subcubic. Their results also imply upper bounds on the integrality gap of 1.461 
and I in these cases. Mucha [13] improves the analysis of this algorithm for 
graph-TSP to a bound of ^ . 

In Schalekamp, Williamson and van Zuylen [TB], three of the authors of this 
paper resolve a related conjecture. A 2-matching of a graph is a set of edges 
such that no edge appears twice and each node has degree two, i.e., it is an 
integer solution to the LP (SUBT) with only constraints (IT]) and (IS]). Note that 
a minimum-cost 2-matching thus provides a lower bound on the length of the 
optimal TSP tour. A minimum-cost 2-matching can be found in polynomial 
time using a reduction to a certain minimum-cost matching problem. Boyd and 
Carr [5] conjecture that the worst case ratio of the cost of a minimum-cost 2- 
matching and the optimal value of the subtour LP is at most -^. This conjecture 
was proved to be true by Schalekamp et al. and examples are known that show 
this result is tight. 

Unlike the techniques used to obtain better results for the graph-TSP, the 
techniques of Schalekamp et al. work on general weighted instances that are 
symmetric and obey the triangle inequality. However, their results only apply to 
2-matchings and it is not clear how to enforce global connectivity on the solution 
obtained by their method. A potential direction for progress on resolving the 
integrality gap for the subtour LP is a conjecture by Schalekamp et al. that the 
worst-case integrality gap is attained for instances for which the optimal subtour 
LP solution is a basic solution to the linear program obtained by dropping the 
subtour elimination constraints. 

In this paper, we turn our attention to the 1,2-TSP, where c{i,j) G {1, 2} for 
all i,j. Papadimitriou and Yannakakis [TS] show how to approximate 1,2-TSP 
within a factor of ^ starting with a minimum-cost 2-matching. In addition, they 
show a ratio of g with respect to the the minimum-cost 2-matching that has no 
cycles of length 3. Blaser and Ram [3j improve this ratio and the best known 
approximation factor of | is given by Berman and Karpinski [3] . 

We do not know a tight bound on the integrality gap of the subtour LP even 
in the case of the 1,2-TSP. As an upper bound, we appear to know only that the 
gap is at most | via Wolsey's result. There is an easy 9 city example showing 
that the gap must be at least ^i see Figure nl This example has been extended 
to a class of instances on 9fc nodes for any positive integer k by Williamson [18] . 
The contribution of this paper is to begin a study of the integrality gap of the 
1,2-TSP, and to improve our state of knowledge for the subtour LP in this case. 
We are able to give the first bound that is strictly less than | for these instances. 
This is the first bound on the integrality gap for the subtour LP with value less 
than I for a natural class of TSP instances. Under an analog of a conjecture of 
Schalekamp et al. [16], we show that the integrality gap is at most g, and with 





Fig. 1: Illustration of the worst example known for the integrality gap for the 
1,2-TSP. The figure on the left shows all edges of cost 1. The figure in the center 
gives the subtour LP solution, in which the dotted edges have value i, and the 
solid edges have value 1; this is also an optimal fractional 2-matching. The figure 
on the left gives the optimal tour and the optimal 2-matching. 



an additional assumption on the structure of the solution, we can improve this 
bound to ^. We describe these results in more detail below. 

We start by giving a boimd on the subtour LP in the general case of 1,2-TSP. 
All the known approximation algorithms since the initial work of Papadimitriou 
and Yannakakis ll5j on the problem start by computing a minimum-cost 2- 
matching. However, the example of Figure [l] shows that an optimal 2-matching 
can be as much as ^ times the value of the subtour LP for the 1,2-TSP, so we 
cannot directly replace the bound on the optimal solution in these approxima- 
tion algorithms with the subtour LP in the same way that Wolsey did with 
Christofides' algorithm in the general case. Using the result of Schalekamp, 
Williamson, and van Zuylen |16j and some additional work, we are able to show 
that an algorithm of Papadimitriou and Yannakakis [IB] obtains a bound on the 
subtour LP for the 1,2-TSP of|-f-h| = ^w 1.3086. 

Next, we show stronger results in some cases. A fractional 2-matching is a 
basic optimal solution to the LP [SUBT) with only constraints (IT]) and (Is]). 
Schalekamp et al. [TB] have conjectured that the worst-case integrality gap for 
the subtour LP is obtained when the optimal solution to the subtour LP is an 
extreme point of the fractional 2-matching polytope. We show that if this is 
the case for 1,2-TSP then we can find a tour of cost at most | the cost of the 
fractional 2-matching, implying that the integrality gap is at most ^ in these 
cases. We then show that if this optimal solution to the fractional 2-matching 
problem has a certain structure, then we can find a tour of cost at most ^ times 
the cost of the fractional 2-matching, implying that the integrality gap is at most 
^ in these cases. Figure 111 shows that this result is tight. The results above all 
lead to polynomial-time algorithms, though we do not state the exact running 
times. 

Finally, we prove that there exists a worst-case integrality gap instance for 
which the optimal value of the subtour LP is less than n + \, where n denotes the 
number of nodes. For such instances, we show that our previous arguments imply 
that one can construct a tour of cost at most ^ times the subtour LP value. 

15 

We anticipate that substantially stronger bounds on the integrality gap can be 
shown. In particular, we conjecture that the integrality gap is in fact exactly ^. 



We perforin computational experiments that show that this conjecture is true 
for n < 12. 

The remainder of this paper is structured as fohows. Section [2] contains pre- 
hminaries and a first general bound on the integrality gap for the 1,2-TSP. We 
show how to obtain stronger bounds if the optimal subtour LP solution is a 
fractional 2- matching in Section [3] In Section HI we combine the arguments from 
the previous sections and show that the integrality gap without any assumptions 
on the structure of the subtour LP solution is at most H . We describe our com- 
putational experiments in Section p\ Finally, we close with a conjecture on the 
integrality gap of the subtour LP for the 1,2-TSP in Section |6] 

2 Preliminaries and a first bound on the integrality gap 

We will work extensively with 2-matchings and fractional 2-matchings; that is, 
extreme points x of the LP [SUBT) with only constraints (fTl) and (pi, where 
in the first case the solutions are required to be integer. For convenience we 
will abbreviate "fractional 2-matching" by F2M and "2-matching" by 2M. The 
basic solutions of the F2M polytope have the following well-known structure 
(attributed to Balinski [2 ). Each connected component of the support graph 
(that is, the edges e for which x{e) > 0) is either a cycle on at least three nodes 
with x{e) ~ 1 for all edges e in the cycle, or consists of odd-sized cycles with 
x{e) = \ for all edges e in the cycle connected by paths of edges e with x{e) = 1 
for each edge e in the path (the center figure in Figure Il| is an example). We call 
the former components integer components and the latter fractional components. 
In a fractional component, we call a path of edges e with x{e) = 1 a 1-path. The 
edges e with x{e) = | in cycles are called cycle edges. An F2M with a single 
component is called connected, and we call a component 2- connected if the sum 
of the X- values on the edges crossing any cut is at least 2. We let n denote the 
number of nodes in an instance. 

As mentioned in the introduction, Schalekamp, Williamson, and van Zuylen |16j 
have recently shown the following. 

Theorem 1 (Schalekamp et al. [165). If ^dge costs obey the triangle inequal- 
ity, then the cost of an optimal 2-matching is at most ^ times the value of the 
subtour LP. 

It is not hard to show that this immediately implies an upper bound of | x ^ 
on the integrality gap of the subtour LP for the 1,2-TSP: we can just compute a 
minimum cost 2-matching, and remove the most expensive edge from each cycle, 
which gives a collection of node disjoint paths, which can be combined into a tour 
of cost at most | x ^ times the value of the subtour LP. The following theorem 
states that applying an algorithm by Papadimitriou and Yannakakis [15] to the 



2-matching will produce a tour of cost at most ^ times the value of the subtour 



Theorem 2. The integrality gap of the subtour LP is at most ^^ for 1,2-TSP. 



Proof. We show that there exists a tour of cost at most | times the cost of 
the optimal 2M plus | times the value of the subtour LP if all edge costs are 
either 1 or 2. Using Theorem [TJ this implies that the tour has cost at most 
g X ^ + I = ^ times the value of the subtour LP. 

Papadimitriou and Yannakakis [TS] observe that we can assume without loss 
of generality that the optimal 2M solution consists of a number of cycles with 
only edges of cost 1 ( "pure" cycles) and at most one cycle which has one or more 
edges of cost 2 (the "non-pure" cycle). Moreover, if i is a node in the non-pure 
cycle which is incident on an edge of cost 2 in the cycle, then there can be no 
edge of cost 1 connecting i to a node in a pure cycle (since otherwise, we can 
merge the non-pure cycle with a pure cycle without increasing the cost). 

The Papadimitriou- Yannakakis algorithm solves the following bipartite match- 
ing problem; On one side we have a node for every pure cycle, and on the other 
side, we have a node for every node in the instance. There is an edge from pure 
cycle C to node i, ii i ^ C and there is an edge of cost 1 from i to some node in 
C. Let r be the number of pure cycles that are unmatched in the maximum car- 
dinality bipartite matching. Papadimitriou and Yannakakis show how to "patch 
together" the matched cycles, and finally how to combine the resulting cycles 
into a tour of cost at most ^0PT(2Af ) + |n -h |r, where 0PT{2M) is the cost 
of an optimal 2M. We claim that 

OPT{SUBT) >n + r, 

where OPT{SUBT) denotes the cost of the optimal subtour LP solution. Note 
that, combined with the analysis of Papadimitriou and Yannakakis and Theorem 
fl} this imphcs that the cost of the tour is then at most | • ^OPT{SUBT) + 
^OPT{SUBT). 

To prove the claim, we note that for a bipartite matching instance, the size 
of the minimum cardinality vertex cover is equal to the size of the maximum 
matching. We use this fact to construct a feasible dual solution to the subtour 
LP that has value n + r. Let Cm, Vm be the pure cycles and nodes (in the original 
graph) , for which the corresponding nodes in the bipartite matching instance are 
in the minimum cardinality vertex cover. The dual of the subtour LP (SUBT) 
is 





Max2^y(5) + 2^y(z)-^z(e) 




SCV ieV ee_E 


{D) subject to: 


)^ y{S) + yii) + y{j) - z{e) < c(e), Ve = I 




SCV:eeS{S) 




viS) > 0, \/ScV,3<\S\<n 




z{e) > 0, Ve e E. 



We set z{e) — for each e £ E, and we set y{i) — ^ for each i e V\Vm- For 
a pure cycle on a set of nodes C, we set y{C) = ^, if the cycle is not in Cm- 
Note that the dual objective for this solution is exactly n + r. It remains to show 
that the dual constructed is feasible. Define the load on an edge e = (i,j) of 



solution {y, z) to be J2sGV:e€S{s} vi^) + vi^) + 2^0) ~ ^(s)- For any edge of cost 
1 inside a cycle of the 2M, the load on the edge is at most 1. For an edge (i, j) 
where i £ C and j € C ^ C, the load is at most 2. Suppose {i,j) has cost 1, 
and the cycles C and C" are both pure cycles. Then the edge occurs twice in 
the bipartite matching instance (namely, once going from ? to C and once going 
from j to C") and hence the dual of at least two of the four objects i,j, C and 
C" has been reduced to 0. The total load on edge {i,j) is thus at most 1. Now, 
suppose C" is the non-pure cycle, then yc = 0, since we only increased the dual 
variables for the pure cycles. Moreover, at least one endpoint of the (j, C) edge 
in the bipartite matching instance must be in the vertex cover, so the load on 
edge {i,j) is again at most 1. D 

D 

We note that the bound obtained on the integrality gap seems rather weak, 
as the best known lower bound on the integrality gap is only ^. Schalekamp, 
Williamson, and van Zuylen |16j have conjectured that the integrality gap (or 
worst-case ratio) of the subtour LP occurs when the solution to the subtour 
LP is a fractional 2-matching. That is, the worst-case ratio for the subtour LP 
occurs for costs c such that an optimal subtour LP solution for c is the same 
as an optimal fractional 2-matching for c. Schalekamp et al. call such costs c 
fractional 2-matching costs for the subtour LP. 

Conjecture 1 (Schalekamp et al. [16j) The integrality gap for the subtour 
LP is attained for a fractional 2-matching cost for the subtour LP. 

In the next section, we show that we can obtain better bounds on the inte- 
grality gap of the subtour LP in the case that the optimal solution is a fractional 
2-matching. In Section [4] we show how to combine the proof of Theorem [2] with 
the bounds in the next section, to obtain a better upper bound on the integrality 
gap. 

3 Better bounds if the optimal solution is an F2M 

If the optimal solution to the subtour LP is a fractional 2-matching, then a 
natural approach to obtaining a good tour is to start with the edges with cost 1 
and a;- value 1, and add as many edges of cost 1 and x- value | as possible, without 
creating a cycle on a subset of the nodes. In other words, we will propose an 
algorithm that creates an acyclic spanning subgraph (V, T) where all nodes have 
degree one or two. We will call an acyclic spanning subgraph in which all nodes 
have degree 1 or 2 a partial tour. A partial tour can be extended to a tour by 
adding d/2 edges of cost 2, where d is the number of degree 1 nodes. The cost 
of the tour is c{T) -\- d, where c{T) = X^esT ^(^)- 
We will use the following lemma. 

Lemma 1. Let G — (V,T) be a partial tour. Let A be a set of edges not in T 
that form an odd cycle or a path on V C V , where the nodes in V' have degree 
one in T. We can find A' C A such that {V, T U A) is a partial tour, and 



- I^'l > ^\A\ if A is a cycle, 

- \A'\ > 1{\A\ ~1) if A is a path, 

We postpone the proof of the lemma and first prove the imphcation for the 
bound on the integrahty gap if the optimal subtour LP solution is a fractional 
2-niatching. 

Theorem 3. There exists a tour of cost at most | times the cost of a connected 
F2M solution if c{i,j) G {1,2} for all i,j. 

Proof. Let P — {e ^ E : x{e) — 1} (the edges in the 1-paths of x). We will 
start the algorithm with T = P. Let R—{e^E: x{e) — i and c(e) — 1} (the 
edges of cost 1 in the cycles of x). Note that the connected components of the 
graph (y, R) consist of paths and odd cycles. The main idea is that we consider 
these components one by one, and use Lemma [l] to show that we can add a large 
number of the edges of each path and cycle, where we keep as an invariant that 
T satisfies the conditions of the lemma. Note that by Lemma [T] the number of 
edges added from each path or cycle A is at least |A|/3, except for the paths for 
which \A\ = 1 (mod 3). Let Vi be this set of paths. We would like to claim that 
we add a third of the edges from each component, and we therefore preprocess 
the paths in Vi, where we add one edge (either the first or last edge from each 
path in Vi) to T if this is possible without creating a cycle in T, and if so, 
we remove this edge and its neighboring edge in R (if any) from R. After the 
preprocessing, we use Lemma [I] to process each of the components in {V, R) . 

We call a path A in Vi "eared" if the 1-paths that are incident on the first 
and last node of the path are such that they go between two neighboring nodes 
of A. We claim that we can add an edge from at least half of the paths in Vi 
that are not eared: If we cannot add either the first or the last edge from a path 
A in Pi, and A is not eared, then it must be the case that either the first or the 
last edge forms a cycle with an edge that was added to T from another path in 
Vi- We remove the edges that were added to T, and their neighboring edge (if 
any) from R. 

We now iterate through the connected components in {V, R) and add edges 
to T while maintaining that T is a partial tour. By Lemma IT] the number of 
edges added from each path or cycle A is at least |j4|/3, except for the paths in 
Vi ■ Note that for a path A in Vi that is not eared, and for which we had already 
added an edge to T in the preprocessing step, will have added a total of at least 
1 + {\A\ — 2 — l)/3 — I A|/3 edges. For a path in Vi for which we did not add an 
edge to T in the preprocessing stage, we have added at least {\A\ — l)/3 edges. 
Now, recall that a path A in Vi has |yl| = 1 (mod 3), and that the number of 
edges added is an integer, so in the first case, the number of edges added is at 
least |A|/3 + I and in the second case it is |A|/3 — |. 

Let z be the number of eared paths in Vi . Then, the total number of edges 
from R added can be lower bounded by \\R\ — \z. We now give an upper bound 
on the number of nodes of degree one in T. 

Let k be the number of cycle nodes in x, i.e. k — #{i e V : x{i,j) — 
i for some j S V}, and let p be the number of cycle edges of cost 2 in x, i.e. 



p ~ #{e G E : x{e) ~ ^ and c(e) = 2}. Note that p > z, since R contains p paths 
on the k cycle nodes. Initially, when T contains only the edges in the 1-paths, all 
k nodes have degree one, and there are k—p edges in R. We argued that we added 
at least ||i?| — |z= ^k— ^p— ^z edges to T. Each edge reduces the number of 
nodes of degree one by two, and hence, the number of nodes of degree one at the 
end of the algorithm is at most k — 2(|fc— |p— |-z) — ■^k+^p+ |z. Recall that 
c(P) denotes the cost of the 1-paths, and the total cost of T at the end of the 
algorithm is equal to c{P) + ^fc— |p— ^z. Since at most |fc+ |p+ 1^ nodes have 
degree one in T, we can extend T into a tour of cost at most c{P) + ^k+-^p+^z. 
The cost of the solution x can be expressed as c{P) + ^k + ^p. Note that 
each 1-path connects two cycle nodes, hence c{P) > -^k. Moreover, an eared 
path A is incident to one (if |A| = 1) or two (if \A\ > 1) 1-paths of length 
two, since the support graph of x is simple. Therefore we can lower bound c{P) 
by ifc + z. Therefore, ^ {c{P) + |fc + ^p) > c{P) + j^k + \z + ^k + ^p > 
c{P) + |fc + 1-2 + kp, where p > z is used in the last inequality. D 

Proof of Lemma^ The basic idea behind the proof of the lemma is the following: 
We go through the edges of A in order, and try to add them to T if this does 
not create a cycle or node of degree three in T. If we cannot add an edge, we 
simply skip the edge and continue to the next edge. Since the edges in T form 
a collection of disjoint paths and each node in A has degree one in T, we can 
always add either the first edge or the second edge of A: if the first edge cannot 
be added, then adding it to T must create a cycle, and since the edges in T form 
a collection of node disjoint paths, adding the second edge of the path or cycle 
to T cannot create a cycle. Similarly, we need to skip at most two edges between 
two edges that are successfully added to T: first, an edge is skipped because 
otherwise we create a node of degree three in T, and if a second edge is skipped, 
then this must be because adding that edge to T would create a cycle. But then, 
adding the next edge on the path cannot create a cycle in T. Hence, the number 
of edges from we can add from each path or cycle A is at least {\A\ — l)/3, if A 
is a path, and [|^|/3J , if ^ is a cycle, where \A\ denotes the number of edges in 
A. 

We now show that by being a little more careful, we can in fact add \A\/3 
edges if j4 is a cycle. Note that the number of nodes in A is odd, and hence there 
must be some j such that the path in T that starts in Uj ends in some node 
V ^ A. We claim that if we consider the edges in A starting with either edge 
{uj-i,Uj} or edge {uj,Uj+i}, we are guaranteed that for at least one of these 
starting points, we can add both the first and the third edge to T. 

Clearly, neither {uj^i,Uj} nor {uj,Uj+i} can create a cycle if we add it to 
T. So suppose that T U {uj^i,Uj} U {uj+i,Uj+2} contains a cycle. This cycle 
does not contain the node Uj, because the path in T that starts in Uj ends in 
some node v ^ C . Hence T contains a path that starts in u^+i and ends in Uj+2- 
But then T U {uj,Uj+i\ U {uj+2,Uj"+3} does not have a cycle, since if it did, 
T must have a path starting in Uj+2 and ending in mj+3 which is only possible 
if Wj+i = Uj+3. Since the number of nodes in A is at least three, this is not 
possible. D 



We remark that the ratio of | in Theorem pi is achieved if every 1-path 
contains just one edge of cost 1, and all cycle edges have cost 1. However, in 
such a case, we could find another optimal F2M solution by removing the 1-path 
with endpoints in two different odd cycles of edges with x{e) = ^, increasing 
the a;- value on the four cycle edges incident on its endpoints to 1, and then 
alternating between setting the x-value to and 1 around the cycles. Now, since 
the cycles are odd, the degree constraints are again satisfied. The objective value 
does not increase because we only change the x-value on edges of cost 1. 

We will call an F2M solution canonical, if there exists no 1-path of cost 1, for 
which the endpoints are in different odd cycles, and the four cycle edges incident 
on the endpoints all have cost 1. 

If the F2M solution is canonical and 2-connected, we can improve the analysis 
in Theorem [3] to show the following. 

Theorem 4. There exists a tour of cost at most ^ times the cost of a 2- 
connected canonical F2M solution if c{i,j) G {1, 2} for all i,j. 

Proof. We adapt the final paragraph of the proof of Theorem [3j As before, the 
cost of the tour is at most c{P) -I- |fc + |p+ ^z. However, since the F2M solution 
X is 2-connected, z = 0: if there is a 1-path connecting two nodes connected by a 
cycle edge {i, j}, then {i,j} defines a cut in x with only two cycle edges crossing 
the cut, and hence x is not 2-connected. 

The cost of the F2M solution is c{P) + \k + ^p, and by the fact that x is 
canonical, we have that c{P) > k — 2p. The proof is concluded by noting that 
then f {c{P) + ifc + ip) > c{P) -I- ifc - |p + |fc + |p = |fc + ip + c{P). D 



4 A better upper bound on the integrality gap 

We now show how to use the results in the previous two sections, to obtain a 
general upper bound that is better than the bound given in Section [21 

Note that in order to bound the integrality gap of the solution obtained by 
the Papadimitriou-Yannakakis algorithm, we need to (i) bound the difference 
between the cost of the 2M solution and the subtour LP, and (ii) bound the 
difference between the 2M solution and the tour constructed from it by the 
Papadimitriou-Yannakakis algorithm. 

As in the proof of Theorem [5] we call a cycle in a 2M solution a "pure" cycle 
if all its edges have cost 1, and a "non-pure" cycle otherwise. The idea behind 
this section is to show that the quantity in (i) can be "charged" to the nodes in 
the non-pure cycles only, and that the quantity in (ii) can be "charged" mainly 
to the nodes in the pure cycles. 

We first state the following lemma, which formalizes the second statement. 

Lemma 2. IfOPT{SUBT) < n+1, then the difference between the 2M solution 
and the tour constructed by the Papadimitriou-Yannakakis algorithm is at most 
jsTipure + jQ^non-pure, whcrc Upure *■? the number of nodes in pure cycles in the 
2M solution, and n„on-pure 'is the number of nodes in non-pure cycles. 



Proof, (sketch) Note that the proof of Theorem [2] and the assumption that 
OPT{SUBT) < n + 1 imply that the Papadimitriou-Yannakakis algorithm finds 
a bipartite matching that matches all the pure cycles. A careful look at the 
analysis of Papadimitriou and Yannakakis [15; then shows that their algorithm 
finds a tour which satisfies the lemma: Papadimitriou and Yannakakis [15j show 
that we can charge the additional cost incurred for patching together the cy- 
cles to the nodes involved, in such a way that each node is charged at most „. 
Note that | < ^ , so the charge to the nodes in pure cycles satisfies the lemma. 
Papadimitriou and Yannakakis distinguish three ways of merging cycles, and it 
is easily seen from their analysis that there is only one case (called case (b) in 
their analysis) in which nodes that are in a non-pure cycle can get charged. In 
that case, there is an increase in cost of 1, that is distributed over the nodes of a 
pure cycle (at least three) and two nodes that may be in a non-pure cycle. This 
cost of 1 is the only charge that needs to be distributed over these five nodes. 
Since we allow the nodes in the pure cycles to be charged jf rather than just 
|, we indeed only need to charge j^ to the two nodes that are in the non-pure 
cycle. D 

We now show that we can indeed restrict our attention to instances with 
OPT{SUBT) < n + 1, the requirement of Lemma [2J 

Lemma 3. The worst-case integrality gap is attained on an instance with sub- 
tour LP value less than n -\- 1, where n is the number of nodes in the instance. 

Proof. Consider an instance I on n nodes for which the ratio between the length 
of the optimal tour and the subtour LP value is 7, and suppose OPT{SUBT) = 
n -\- k for some k > 1. We construct an instance /' with n' = n -\- I nodes, for 
which the ratio between the length of the optimal tour and the subtour LP value 
is at least 7, and the optimal value of the subtour LP is at most n+k = n' + k—l. 
Repeatedly applying this procedure proves the lemma. 

If OPT{SUBT) = n-\- k, then the subtour LP solution on / has a total 
x-value of k on edges of cost 2, since the objective value is equal to J^eeE ^(^) + 
EeG-E:c(e)=2 ^^(e), and J^eeE ^^(e) = 5 J2vev Y^eesiv) 2;(e) = n. We fix an optimal 
subtour solution x, and we construct /' from /, by adding one node i, and adding 
edges {i,j) of cost 1, for every j in / that is incident on an edge e with c(e) = 2 
and x{e) > 0. All other edges incident on i get cost 2. Note that the optimal 
tour on /' has length at least the length of the optimal tour on /, since we can 
take a tour on /' and shortcut i to obtain a tour on /. On the other hand, we 
can use x to define a feasible solution on /', by "rerouting" one unit in total 
from edges e = (j, k) with c(e) = 2 to the edges (j, i) and (i, k). Since the cost 
of this solution on /' is the same as the cost of x, the ratio between the length 
of the optimal tour and the subtour LP value has not decreased. D 

Theorem 5. The integrality gap of the subtour LP is at most y| for the 1,2- 
TSP. 

Proof. By Lemmalsj we can assume without loss of generality that OPT{SUBT) 
< n + 1. We first find an optimal F2M solution, and use Theorem [3] to convert 



n 


6 


7 


8 


9 


10 


11 


12 


Subtour IP/LP ratio 
# graphs 


8/7.5 
56 


8/7.5 
468 


9/8.5 
7,123 


10/9 
194,066 


11/10 
9,743,542 


12/11 
900,969,091 


13/12 



Table 1: The subtour LP integrality gap for 1,2-TSP for 6 < n < 12, along with 
the number of nonisomorphic biconnected graphs for 6 < n < 11. 



each component of the F2M solution into a cycle on the nodes in the component. 
Note that the F2M problem is a relaxation of the subtour LP, and, since the 
F2M solution is half-integral, its objective value is either n + -^ or n. Let Up^re 
and rinon-purc be defined as in Lemma [2J where the cycles are those returned by 
applying Theorem[3]to each connected fractional component of the F2M solution. 
By Theorem J3j the total cost of the resulting 2-matching is at most ripure + 

g'T'non-pure + J^^i the F2M SOlutioU haS cost n+^ and at most Upuro + g '^non-pure 

if the F2M solution has cost n. Combining this with Lemma[2] there exists a tour 

+ (| + 17)K 



'■puro"ri,g"rYo;"-non-pure"rY2'^ ^ W^ ) ^ ["re-l- 5 > 'T- + 1)) 

^6 ' io;"'non-pure, if 0PT(S't/i3r) e [ri,n+i). In either 
case, the cost of 'the tour is at most ]^OPT{SUBT). " D 



of cost at most n+j^n 

and at most n+j^np„rc + il + jQ)n 



5 Computational results 

In the case of the 1,2-TSP, for a fixed n we can generate all instances as follows. 
For each value of n, we first generate all nonisomorphic graphs on n nodes using 
the software package NAUTY [TT] . We let the cost of edges be one for all edges 
in G and let the cost of all other edges be two. Then each of the generated graph 
G gives us an instance of 1,2-TSP problem with n nodes, and this covers all 
instances of the 1,2-TSP for size n up to isomorphism. 

In fact, we can do slightly better by only generating biconnected graphs. We 
say that a graph G — (V, E) is biconnected if it is connected and there is no 
vertex v (£ V such that removing v disconnects the graph; such a vertex u is a 
cut vertex. It is easy to see that the subtour LP value is at least n -I- 1 if G is not 
biconnected, hence, by Lemma [3] it suffices to consider biconnected graphs. 

For each instance of size n, we solve the subtour LP and the corresponding 
integer program using CPLEX 12.1 '8] and a Macintosh laptop computer with 
dual core 2GIIz processor and 1GB of memory. It is known that the integrality 
gap is 1 for n < 5, so we only consider problems of size n > 6. The results are 
summarized in Table [l] For n = 11, the number of nonisomorphic biconnected 
graphs is nearly a billion and thus too large to consider, so we turn to another 
approach. For n = 11 and n = 12, we use the fact that we know a lower bound 
on the integrality gap of ^^, namely for the instances depicted in Figure pi The 
claimed lower bounds on the integrality gap for these instances follow readily 
from the integrality gap for the example in Figure [l] We then check whether 
this is the worst integrality gap for each vertex of subtour LP. A list of non- 
isomorphic vertices of the subtour LP is available for n = 6 to 12 at Sylvia 




Fig. 2: Illustration of the instances with integrality gap at least ^j for n = 11 
(without the grey node) and y| for 71 = 12 (with the grey node) for the 1,2-TSP. 
All edges of cost 1 are shown. 



Boyd's website http://www.site.uottawa.ca/~sylvia/subtourvertices, In 
order to check whether the lower bound on the integrality gap is tight, we solve 
the following integer programming problem for each vertex x of the polytope for 
n = 11 and n = 12, where now the costs c(e) are the decision variables, and x is 
fixed: 



Max z — an /^ c(e)x(e) 
subject to: 



eG-E 



y Ce > z, V tours T, 

e<£T 

Cee{l,2}, \fe£E. 

Note that a„ is the lower bound on the integrality gap for instances of n nodes. 
If the objective is nonpositive for all of the vertices of the subtour LP, then we 
know that a„ is the integrality gap for a particular value of n. 

Since the number of non-isomorphic tours of n nodes is (n — 1)!/2, the number 
of constraints is too large for CPLEX for n = 11 or 12. We overcome this difficulty 
by first solving the problem with only tours that have at least n—1 edges in the 
support graph of the vertex x, and repeatedly adding additional violated tours. 
We find that the worst case integrality gap for n = 11 is jy and for n = 12 is y|. 



6 Conjectures and conclusions 

As stated in the introduction, we conjecture the following. 

Conjecture 2 The integrality gap of the subtour LP for the 1,2-TSP is ^. 

Schalekamp, Williamson, and van Zuylen [16j have conjectured that the in- 
tegrality gap (or worst-case ratio) of the subtour LP occurs when the solution to 
the subtour LP is a fractional 2-matching. We have shown in Theorem |3] that if 
an analogous conjecture is true for 1,2-TSP, then the integrality gap for 1,2-TSP 
is at most I; it would be nice to show that if the analogous conjecture is true 
for 1,2-TSP then the integrality gap is at most ^. 
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